Speaker adaptation experiments using nonstationary-state hidden Markov models: a MAP approach

نویسندگان

  • Rathinavelu Chengalvarayan
  • Li Deng
چکیده

In this paper, we report our recent work on applications of the MAP approach to estimating the time-varying polynomial Gaussian mean functions in the nonstationary-state or trended HMM. Assuming uncorrelatedness among the polynomial coefficients in the trended HMM, we have obtained analytical results for the MAP estimates of the time-varying mean and precision parameters. We have implemented a speech recognizer based on these results in speaker adaptation experiments using TI46 corpora. Experimental results show that the trended HMM always outperforms the standard, stationary-state HMM and that adaptation of polynomial coefficients only is better than adapting both polynomial coefficients and precision matrices when fewer than four adaptation tokens are used.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A maximum a posteriori approach to speaker adaptation using the trended hidden Markov model

A formulation of the maximum a posteriori (MAP) approach to speaker adaptation is presented with use of the trended or nonstationary-state hidden Markov model (HMM), where the Gaussian means in each HMM state are characterized by time-varying polynomial trend functions of the state sojourn time. Assuming uncorrelatedness among the polynomial coefficients in the trend functions, we have obtained...

متن کامل

A Bayesian Prediction Approach to Robust Speech Recognition and Online Speaker Adaptation

Because the acoustic environments are uncertain and nonstationary, it is necessary to characterize the uncertainty of speech hidden Markov models (HMM’s) for recognition and trace the uncertainty sequentially to match the nonstationary environments. In this study, we develop a new Bayesian predictive classification (BPC) framework for robust decision and online speaker adaptation. The BPC decis...

متن کامل

Speech recognition in non-stationary adverse environments

A bshctIn this paper, we introduce a new approach, called nonstationary adaptation (NA), to recognize speech under nonstationary adverse environments. Two models are used: one is a speaker-independent hidden Markov model (HMM) for clean speech, the other is an ergodic Markov chain representing the nonstationary adverse environment. Each state in the Markov chain represents one stationary advers...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997